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Abstract 

We consider the problem of testing multiple quantum hypotheses 
, p®"}, where an arbitrary prior distribution is given and each 
of the r hypotheses is n copies of a quantum state. It is known that the 
minimal average error probability decays exponentially to zero, that 
is, Pe = exp{—+ o(n)}. However, this error exponent ^ is generally 
unknown, except for the case that r = 2. 

In this paper, we solve the long-standing open problem of iden¬ 
tifying the above error exponent, by proving Nussbaum and Szkola’s 
conjecture that ^ C{pi, pj). The right-hand side of this equal¬ 

ity is called the multiple quantum Chernoff distance, and C{pi,pj) := 
maxo<s<i{—logTrpfpj”®} has been previously identihed as the opti¬ 
mal error exponent for testing two hypotheses, pf" versus p®”. 

The main ingredient of our proof is a new upper bound for the 
average error probability, for testing an ensemble of finite-dimensional, 
but otherwise general, quantum states. This upper bound, up to a 
states-dependent factor, matches the multiple-state generalization of 
Nussbaum and Szkola’s lower bound. Specialized to the case r = 2, we 
give an alternative proof to the achievability of the binary-hypothesis 
Chernoff distance, which was originally proved by Audenaert et al. 
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1 Introduction 


A basic problem in information theory and statistics, is to test a system 
that may be prepared in one of r random states. Treated in the framework 
of quantum mechanics, the testing is performed via quantum measurement, 
and the physical states are described by density matrices 001 , 002 , ■■■ ,ujr, 
namely, positive semidefinite Hermitian matrices of trace 1. It is a notable 
fact that, when ooi’s commute, the problem reduces to classical statistical 
testing, among r probability distributions that are given by the arrays of 
eigenvalues of each of the density matrices. However, the generally noncom- 
mutative feature makes quantum statistics much richer than its classical 
counterpart. 

Our main focus in the current paper will be on the asymptotic setting. 
Let the tensor product state p®"" denotes n independent copies of p, in 
analogy to the probability distribution of i.i.d. random variables. We are 
interested in the asymptotic behavior of the average error P^, in discrimi¬ 
nating a set of quantum states {pf ”, • • •, P®”}, when an arbitrary prior that 
is independent of n is given. Parthasarathy showed that Pe decays exponen¬ 
tially, that is, Pe = exp{—^n-l-o(n)} [M]. However, to date the optimal error 
exponent as a functional of the states pi,..., pr, is generally unknown. 

Significant achievements have been made for the case of testing two 
quantum hypotheses (r = 2). In two breakthrough papers, [T] and [29], it 
has been established that the optimal error exponent in discriminating p®” 
and pf”, equals the quantum Chernoff distance 

C{pi,p 2 ) := maxj-logXrpfp^"*}. 

0<s<l 

Audenaert et al in [T] solved the achievability part, in the meantime Nuss- 
baum and Szkola in [29| proved the optimality part. This provides the 
quantum generalization of the Chernoff information as the optimal error 
exponent in classical hypotheses testing (9|; see also [TO] . 

The solution for the general cases r > 2 is still lacking and it does 
not follow from the binary case directly. The optimal tests, as analogs of 
the classical maximum likelihood decision rule, have been formulated in the 
1970s. For discriminating two states it has an explicit expression known as 
the Holevo?Helstrom test [Ml El], and indeed, the proof in [T] relies on a 
nontrivial application of this HolevoTHelstrom test. In contrast, for discrimi¬ 
nating multiple quantum states the corresponding optimal measurement can 
only be formulated in a very complicated, implicit way [201111]. Such a situ¬ 
ation illustrates the difficulty in dealing with the asymptotic error exponent, 
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for the multiple case r > 2. Intuitively, competitions among pairs make the 
problem complicated. 

Nussbaum and Szkola introduced the multiple quantum Chernoff dis¬ 
tance 

C(pi,...,p^):= niin C{pi,pj), 

and conjectured that it is the optimal asymptotic error exponent, in discrim¬ 
inating quantum states pf ^,..., pf^ [SaEIlES]. This is in full analogy to 
the existing results in classical statistical hypothesis testing [231 ESI EH EH] ■ 
Significant progress has been made towards proving this conjecture. Besides 
the case of commuting states which reduces to the classical situation, it has 
been proven to be true in several interesting special cases. These include 
when the supporting spaces of the states pi,...,pr are disjoint [32], and 
when one pair of the states is substantially closer than the other pairs, in 
Chernoff distance [28l[2]. In general, Nussbaum and Szkola showed that the 
optimal error exponent ^ in testing multiple quantum hypotheses, satisfies 
C/3 < ^ < C [32], and Audenaert and Mosonyi recently strengthened this 
bound, showing that C/2 < ^ < C 

In this paper, we shall prove the aforementioned conjecture, that is, we 
show that the long-sought error exponent in asymptotic quantum (multiple) 
state discrimination, is given by the (multiple) quantum Chernoff distance. 
Besides, as a main ingredient of the proof we derive a new upper bound 
for the optimal average error probability, for discriminating a set of finite¬ 
dimensional, but otherwise general, quantum states. This one-shot upper 
bound has the advantage that, up to a states-dependent factor, it coin¬ 
cides with a multiple-state generalization of Nussbaum and Szkola’s lower 
bound [29] . 

Before concluding this section, we review the relevant literature. Asymp¬ 
totics of statistical hypothesis testing is an important topic in statistics and 
information theory, and is especially useful in identifying basic information 
quantities. We refer the interested readers for a partial list of classical re¬ 
sults to [6] m do] [131 El EHl [23], and quantum results to [H [31 [5l [8l [TSl 
[SlEZlEHlEalEH]. The optimal or approximately optimal average error in 
quantum state discrimination, and the corresponding tests to achieve it, is 
a basic problem in quantum information theory and has attracted extensive 

study; see, for example, [21 [31 El El [231 El [221 ESI [iQl E] • 

The remainder of this paper is organized as follows. After introducing 
some basic notations, concepts and the relevant aspects of the quantum 
formalism in Section [21 we present the main results in Section [3l Section [4] 
is dedicated to the proofs. At last, in Section [5l we conclude the paper with 
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some discussion and open questions. 


2 Notation and preliminaries 

Let B{T-L) denote the set of linear operators on a complex, finite-dimensional 
Hilbert space Let V{'H) C B{'H) be the set of positive semidehnite 
matrices, and 'D{'H) := {cu : a; € and Tr a; = 1} is the set of density 

matrices. We say a matrix A> 0 if A £ and A > B A — B > 0. The 

dimension of the Hilbert space Ti is denoted as iTfl. 1 denotes the identity 
matrix. We use the Dirac notation ju) £ T-L to denote a unit vector, (u| its 
conjugate transpose, and (u|rc) the inner product. A Hermitian matrix X 
can be written in the spectral decomposition form: X = XiQi, where Aj’s 
satisfying Aj 7^ Xj for i / j are the eigenvalues, and Qj’s satisfying QiQj = 
SijQi and YliQi ~ ^ orthogonal projectors onto the eigenspaces. 

supp(A) is the supporting space of X and is spanned by all the eigenvectors 
with non-zero eigenvalues, {X > 0} := Qi represents the projector 

onto the positive supporting space of X, and D(X) := |{Ai}i| denotes the 
number of eigenspaces, or distinct eigenvalues. For a subspace S C B, 
proj(S') is the projector onto S. The sum of two subspaces 81,82 C B, is 
dehned as 81 + 82 '■= {u -|- v\u £ 81 , v £ 82 }- When we say the overlap 
between two subspaces Si and 82 , we mean the maximal overlap between 
two unit vectors from each of them: max{|(u|t(;)| : ju) £ 81 , jic) £ 82 }- 

We briefly review some aspects of the quantum formalism, relevant in 
this paper. Every physical system is associated with a complex Hilbert 
space, which is called the state space. The states of a system are described 
by density matrices. Pure states are of particular interest, and are rep¬ 
resented by rank-one projectors, or simply the corresponding unit vectors. 
Throughout this paper, we are concerned with quantum states of a hnite 
system, associated with a hnite-dimensional Hilbert space. A density ma¬ 
trix oj can be decomposed as the sum of an ensemble of pure states, that 
is, ui = with {pi} a probability distribution. An intuitive 

understanding is that pure states represent “deterministic events”, and a 
density matrix is the quantum analogue of a probability distribution over 
these events. However, note that this decomposition is not unique, and 
non-orthogonal pure states are not perfectly distinguishable. 

The procedure of detecting the state of a quantum system is called quan¬ 
tum measurement, which, in the most general form, is formulated as positive 
operator-valued measure (POVM), that is, A4 = {Mj}*, with the POVM el¬ 
ements satisfying 0 < Mi < 1 and '^iMi = 1. When performing the 
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measurement on a system in the state w, we get outcome i with probability 
Tr(ti;Mj). Projective measurements, or von Neumann measurements, are 
special situations of POVMs, where all the POVM elements are orthogonal 
projectors: MiMj = 5ijMi, with 6 ij the Kronecker delta function. 

Suppose a physical system, also called an information source, is in one of 
a finite set of hypothesized states {lji ,..., }, with a given prior {pi,,pr}. 

For convenience, we denote them as a normalized ensemble {Ai := piLOi,... ,Ar := 
PrUJr}. To determine the true state, we make a POVM measurement {Mi,..., Mr}, 
and infer that it is in the state Ui if we get outcome i. The average (Bayesian) 
error probability is 

r 

Pe{{Au...,Ar};{M,,...,Mr}) (1) 

^=1 

Minimized over all possible measurements, this gives the optimal error prob¬ 
ability 

P: ({Ai, ..., Ar}) := - Mi): POVM (Mi,..., Mjj . (2) 

We note here that the definitions 0 and m apply, as well, to a non- 
normalized ensemble of quantum states which only satisfies 

the constraint (Vi) Ai > 0. In this case, Pg and Pg may not have a clear 
meaning but sometimes can be useful. 

In the asymptotic setting where w* is replaced by the tensor product 
state /of’"', we are interested in the behavior of the optimal error P*, as 
n —>■ oo. An important quantity characterizing this asymptotic behavior, is 
the rate of exponential decay, or simply error exponent 

liminf — logP^* {{pipT, • • • ,PrpT}) ■ 

n—^oo 71 


3 Results 

Our main result is the following Theorem [TJ Recall that, for the case r = 2 
of testing two hypotheses, it has been proven nearly a decade ago in 2006; 
see [U and [29] . 

Theorem 1. Let {pi ,..., pr] be a finite set of quantum states on a finite¬ 
dimensional Hilbert space H. Then the asymptotic error exponent for testing 
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{pf”,..., pf"'}, for an arbitrary prior {pi,... ,pr}, is given by the multiple 
quantum Chernoff distance: 

lim — log Pf {{pipf^, ..., PtpT}) =, min max |- log Ti' ptp]~"} • (3) 

n-i>oo n 0 <s<l ■' J 

The optimality part, that is, the left-hand side of equation ([3|) being 
upper bounded by the right-hand side, follows easily from the optimality of 
the binary case r = 2 [29]; see the argument in m- Roughly speaking, this 
is because, discriminating an arbitrary pair within a set of quantum states is 
easier than discriminating all of them. On the other hand, the achievability 
part is the main difficulty in proving Theorem [TJ In [T] , Audenaert et al em¬ 
ployed the Holevo?Helstrom tests ({pf”' — pf"' > 0}, 1 — {pf"' — pf"' > 0}) 
to achieve the binary Chernoff distance in testing pf^ versus pf”- However, 
to date we do not have a way to generalize the method of Audenaert et al to 
deal with the r > 2 cases, even though there is the multiple generalization 
of the Holevo?Helstrom tests [201 HU; see discussions in [32| and [2] on this 
issue. Here, using a conceptually different method, we derive a new upper 
bound for the optimal error probability of equation ([2|). This one-shot er¬ 
ror bound, as stated in Theorem [2l works for testing any finite number of 
finite-dimensional quantum states, and when applied in the asymptotics for 
i.i.d. states, accomplishes the achievability part of Theorem [1] 

Our method is inspired by the previous work of Nussbaum and Szkola |32j . 
It is shown in [32] that if the supporting spaces of the hypothetic states 
pi,...,Pr are pairwise disjoint (this means that the supporting spaces of 
pf”,..., pf^ are asymptotically highly orthogonal), then the Gram-Schmidt 
orthonormalization can be employed to construct a good measurement, 
which achieves the error exponent of Theorem [H Here to prove Theorem [1] 
for general hypothetic states, we find a way to remove a snbspace from 
each eigenspace of the states pf ”,..., p®”. Then we show that, on the one 
hand this removal will cause an error that matches the right-hand side of 
equation ([3]) in the exponent, and on the other hand the pairwise overlaps 
between the supporting spaces of pf”,... ,p®” are made sufficiently small, 
such that the Gram-Schmidt orthonormalization method is applicable. For 
the sake of generality, we will actually realize these ideas for general non¬ 
negative matrices Ai,..., yielding the following Theorem [2] 

Theorem 2. Let Ai,...,Aj. € V{'H) be nonnegative matrices on a finite¬ 
dimensional Hilbert space H. For all 1 < i < r, let Aj = Ylk=i \kQik be the 
spectral decomposition of Ai, and write T := max{Ti,... ,Tr}. There exists 
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a function f{r,T) such that 

P:{{Au...,Ar})<f{r,T) ^ '^mm{X^k,Xji}TiQikQje (4) 

k,i 

and we have f{r,T) < 10(r — 

Our upper bound of equation (jH), up to an r- and T-dependent fac¬ 
tor, coincides with the multiple-state generalization of the lower bound of 
Nussbaum and Szkola |29] . To see this, using the result in [35], we easily 
generalize the bound obtained in [29] and get 

Pf {{Ai,... ,Ar}) > ^ '^mm{Xik,Xje}TvQikQji. (5) 

In the case that r = 2, it is interesting to compare equation ([3|) with the 
upper bound of Audenaert et al [T]: 

P;({Ai,A 2 })< min TrA^Ai-T (6) 

0<s<l 

While we see that our bound is stronger, in the sense that 

^minjAifc, A2£}TrQifc(52£ < min TrAJA^”^ (7) 

k,i - 

is always true, we also notice that it is weaker because it has an additional 
multiplier depending on the number of eigenspaces of the two states. 

4 Proofs 

This section is dedicated to the proofs of Theorem[T]and Theorem[2j At first, 
we present a definition and some necessary lemmas in Section 14. 11 Then we 
construct the measurement for discriminating multiple quantum states in 
Section 14.21 Using this measurement, we prove Theorem [2] in section 14.31 
At last, built on Theorem [2l Theorem [1] will be proven in Section 14.41 

4.1 Technical preliminaries 

We begin with the definition of the operation “e-subtraction” between two 
projectors or two subspaces. This operation, say, for two subspaces and 
S 2 -, reduces the overlap between them by removing a subspace from Si, actu¬ 
ally, in the most efficient way. It will constitute a key step in the construction 
of measurement. 
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Definition 3 (e-subtraction). Let Si, S 2 be two subspaces of a Hilbert space 
H. Let Pi,P 2 G V{H) be the projectors onto Si and S 2 , respectively. Write 
P 1 P 2 P 1 in the spectral decomposition, P 1 P 2 P 1 = with Qx being 

orthogonal projectors and ~ Por 0 < e < 1 , the e-subtraction of 

P 2 from Pi is defined as 


Pi ©e P 2 '■= Pi — ^ Qx- ( 8 ) 

x-.Xx^e^ ,Xx^Q 

Accordingly, the e-subtraction between subspaces is defined as 


Si Be S 2 := SUpp(Pi Qe P2)- 


(9) 


Note that in equation Q the constraint A^; 7 ^ 0 makes sense only when 
e = 0. The following lemma states some basic properties of the e-subtraction. 


Lemma 4. Let Si,S 2 he two subspaces of a Hilbert space H. Let Pi,P 2 G 
V{H) be the projectors onto Si and S 2 , respectively. Write S[ = Si 0 ^ S 2 , 
and P{ = Pi ©e P 2 . Then 

1. S'l is a subspace of Si; P{ is a projector, and 0 < P{ < Pi. 

2 . ^o.s bounded overlap with S 2 : 


max 

|'ui)eSbh2)GS’2 


|(?;i|u 2 )| < e, 


where the maximization is over unit vectors (ui|ui) = (U 2 IU 2 ) = 1. 

3. For 0 < e < 1, we have 


TiiPi-P[) < \tvPiP2 . 


Proof. Let P 1 P 2 P 1 = Yhx^^Qx be the spectral decomposition of P 1 P 2 P 1 , 
with 0 < Aa; < 1 . 

1. Obviously, supp(PiP 2 Pi) C Si. Thus the following three projectors 
satisfy 

Qx< Y. ( 19 ) 

x-.Xx'>e^ ,XW^ 

This, together with Definition [3l implies that P( is a projector and satisfies 
0 < P( < Pi- The fact that S[ is a subspace of Si, follows directly. 
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2. It follows from equation (jlOp and Definition [3] that we can write S[ as 



That is to say, S[ is the direct sum of the eigenspaces of P 1 P 2 P 1 with 
corresponding eigenvalues in the interval (0, e^), together with a subspace of 
the kernel of PiP 2 Pi- So, for any unit vectors jui) G \v 2 ) G 82 -, 

|(^^l|t’2)| = a/ {vi\v2){v2\vi) < a/ {vi\P2\vi) = \/{vi\PiP2Pl\vi) < 6. 


3. This inequality can be verified as follows. 


TV(Fi-p;)=ri' Y. 


<Tt 

= \TrPiP2Pi = 4TTrPiP2- 


□ 

Lemma El and Lemma [6] below are basic results for subspaces of an inner 
product space. 

Lemma 5. Let V, W be subspaces of a Hilbert space H, and have direct-sum 
decompositions V = ©£i and W = ©Jii Wj. Suppose we have 

max < d, for alll <i < Ti, 1 < j < T 2 . 

\v)&C^^\w)£Wj 

Then the overlap between V and W is bounded as 

max < \/TiT2 (5. 

IDGLI«'>Giy 

Proof. Let |u) G V and |t(;) G VL be any two unit vectors. Write |u) = 
and Itc) = Ylj/3j\wj), with \vi) G Vi,\wj) G Wj and = 
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~ use of the Cauchy-Schwarz inequality, we have 








i=l j=l 




and this finishes the proof. 


□ 


Lemma 6. Let Si, S 2 , ■ ■ ■, Sr be subspaces of a Hilbert space, such that the 
overlaps between them are bounded as 

max |(uj|u,)| <6, 1 < i ^ j < r. 

“ - 7-J- 

For alll <i <r, denote the projector onto Si as Pi, and denote the projector 
onto S = Si + S 2 + ■■■ + Sr as P. Suppose 6 < 2 {r-i) ' 



( 11 ) 


Proof. For an arbitrary unit vector |u) G S, write |u) = Yli=i ^ with 
\vi) € Si. Then 


r 


{v\Y,Pk\v)= Y. aj{vi\Pk\vj) + E E aiaj{vi\Pk\vj) 


k=l 


{i,k)-.i=k j 


* ijL)-j=k 



{i,j,k)-. 


k i:i^k j\j^k 


i=j=k 


r 


r 


2 {v\v) -Y\^i 


^ + Y d'^) “ “fclufc)). 


k=l 
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The last term is nonnegative. Besides, we have 


1 ^ ^ aiaj{vi\vj) 



2=1 


IV 

M- 

Wi?- Y. 

\ai\ ■ \aj \ ■ \{vi\vj)\ 

i=l 



Al 

Wi?- Y 

( \aj\‘^) 5 

2 

2=1 



= (1- 

(r - 1)5) Y 
2=1 



Combining the above arguments, we get 

k=l ' 

which implies equation (llip . 


_ 1 - 2(r - 1)(5 
— (r — 1)(5 1 — (r — 1)5 ’ 


□ 


4.2 Construction of the measurements 

In the following, we will describe the procedure of constructing a family of 
projective measurements {ni(e),..., nj._i(e), nj.(e) + n,.+i(e)}, which will 
be used to show that the right-hand side of equation Q is an achievable 
error probability. 

Our construction is similar to the ones explored in [311 M and ear¬ 
lier in [21], especially, in applying the Gram-Schmidt orthonormalization 
to states that are ordered according to the corresponding eigenvalues, for 
formulating the projective measurements. However, our method is also sig¬ 
nificantly different from those of [m [311 [32]. At first, instead of dealing with 
every eigenvector of the hypothetic states individually, we treat each of the 
eigenspaces as a whole. This, for i.i.d. states of the form w®”, is reminiscent 
of the method of types mm, from which we indeed benefit. Secondly, we 
carefully remove from each of these eigenspaces a subspace, in a way such 
that the perturbation to the hypothetic states is limited but the overlaps 
between the supporting spaces of them become sufficiently small. As a re¬ 
sult, we can effectively employ the Gram-Schmidt process to formulate an 
approximately optimal measurement. 
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Recall that iov 1 < i < r, Ai = Ylk=i ^ikQik is the spectral decom¬ 
position. Let Sik := supp((5ifc) be the eigenspaces of Ai. From now on, 
we always identify the subscript “ik” with “(i,k)”. So Xik, Qik and Sik are 
also denoted as A(j fc), and respectively. Dehne the index set 

O := Ui=i{(L A;) : A; G N, 1 < A: < Tj}. Now we arrange all the eigenvalues 
{^ik}{i,k)GO ia a decreasing order, and let 5 : {0,1, 2,... , |Ci|} {(0,0)}UCi 

be the bijection indicating the position of each in such an ordering: 


\(i) > \(2) > ••• > Ag(|c)|), (12) 

and ( 7 ( 0 ) = (0,0) is introduced for later convenience. Our construction 
consists of the following three steps. 

Step 1: reducing the overlaps between the eigenspaces. For this purpose, 
we employ the operation e-subtraction to remove a subspace from each of 
these eigenspaces. Let be endowed with a left associativity, that is, 
AQ^B Q^C := {A 0 £ B) ©^ C. Set Qg{o) = 0 and S'g(o) = {0}. We dehne 


Qg{m) 



and 


<55(0), ifm = 0 

Qg{m) ©e Qg{0) ©e Qg{l) ©e ‘ ' ‘ ©e Qg{m—l): if 1 © Ul < |0| 

(13) 


§ J ‘5’g(o), ifm = 0 

^ \ Sg[rn) ©e 5'g(o) ©e ^^(i) ©£••■©£ if 1 < m < \0\. 

(14) 

Note that, according to Dehnition[3l = supp ( Qgim )) ■ Now we denote 


Ti 

Ai .— ^ ^ XikQikj 1 © i © r. (Ih) 

k=l 


We will show later that, for the purpose of the current paper, Ai is a good 
approximation of Ai. 

Step 2: orthogonalizing the eigenspaces. To formulate the projective 
measurement, we need to assign each of the states {Ai}i an orthogonal 
subspace, for the projectors to be supported on. To do so, we treat 74 j’s 
as representives of Aj’s, and orthogonalize the subspaces {<Sg(m)}m) using a 
Gram-Schmidt process. Dehne <Sg(o) := {0}, and for all 1 < m < |0|, 

Sg(m) ■= (Sg(0) H-h *§g(m)) ©1 (Sg{0) H-h m—1) j ; (16) 
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where 0i is the operation “e-subtraction” with e = 1. Recalling Definition [3l 
we easily see that the right-hand side of equation (I16p is just the orthog¬ 
onal complement of <S'g(o) -f • • • -f Sg^m-i)j in the space < 53 ( 0 ) -f • • • -f 
noticing that obviously the former is a subspace of the latter. So the sub¬ 
spaces <Sg(i), ■ ■ ■, ‘5g(m) are mutually orthogonal. Thus the definition in equa¬ 
tion (fT6|) is equivalent to 


4(1) -^ 4(m) = 0 4(i)’ 1 < m < \0\. (17) 

t=i 

Note that it is possible that Sg{jn) = {0} for certain values of m, and in 
these cases we have proj ^4(m)) ~ 

Step 3: defining the family of projective measurements. We set 

Ti 

ni(e) := ^proj , 1 < i < r, (18) 

k=l 

and let 

r 

nr+i(e) := l - ^ni(e). (19) 

Here the parameter e is introduced in step 1. By definition, ni(e),..., nr+i(e) 
are orthogonal projectors and X]i4ni(e) = I- So, they form a projective 
measurement. Our strategy for testing Ai,..., Ar is that, if we get the mea¬ 
surement outcome associated with n.j(e), we conclude that the state is A^. 

For the outcome associated with the extra projector nr+i(e), we can make 
any decision, or just report an error; here we simply assign it to A^. Thus, 
the family of measurements that we construct for use is 

n = {ni(e),..., nj._i(e), nj.(e) -i- nj.+i(e)}. ( 20 ) 

4.3 Proof of the one-shot achievability bound: Theorem [2] 

We show that, for properly chosen e € [0,1], the measurement constructed 
in Section 14.21 will achieve an error probability that equals the right-hand 
side of equation (14|) . 

Proof of Theorem [M For the ensemble of nonnegative matrices A = {Hi,..., Hj.}, 
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and the measurement IT specified in equation ([20|) , we have 


r—1 

Pe{A-,Tl) =^Tr (1 - ni(e)) + TiAr (1 - n^(e) - nr+i(e)) 

i=l 

r 

<^Tr^i(l-ni(e)). 

1=1 

We now make use of the matrices {Ai}, which are defined in step 1 of the 
measurement construction in section 14.21 cf. equantion (IlSp . Substituting 
{Ai—Ai)+Ai for Ai, and noticing that it is an obvious result of equations (fT^ 
and (|15p that Ai — Ai> 0, we further bound the error probability as 

r r 

Pe{A■,U)<J2MA^-A,) + Y.^TA{t-Ui{e)). ( 21 ) 

i=l i=l 

In the following, we will evaluate the two terms of the right-hand side of 
equation (l2Tp . separately. 

Invoking the spectral decomposition of Ai, and using equation (fTSl) . we 
can write 


r r Ti 

ETr(^*-^*)=EE ^ik Qik) 


i=l 


i=l k=l 


\o\ 

= E ^9im) Tr 

m=l 


(^Qg(m) Qg(m) 


( 22 ) 


where we have used the map g, introduced in the previous section, to indicate 
the subscripts. For each integer 2 < m < \0\, applying the third result of 
Lemma 0] to the e-subtraction formulas 


{Qg{m) Qe Qg{0) ©e Qg{l) ©e ' ' ' ©e Qg(t-1)) ©e Qg(t) j 1 < t < m — 1, 
we obtain 

Tr {Qg{m) ©e Qg{0) ©e ' ‘ ‘ ©e Qg{t-1)) ~ Tr {Qg(m) ©e Qg{0) ©£-••©£ Qg{t)) 
— ^ Tl {C^g[rri) ©£ Qg{0) ©£ ' ' ' ©£ Qg{t—1)^ Qg{t) 

< ^ Tr Qg(m)Qg{t): 

(23) 
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where for the last inequality we have used repeatedly the first result of 
Lemmadl Summing over t G {1, 2,..., m — 1} at both the first and the last 
line of Equation (l23]l , yields 

Qg[m) Tr (^Qg{rn) ©e Qg{0) ©e Qg{l) ©e ‘ ' ‘ ©e Qg(m—1)) 

1 ^ (24) 

t=i 

for all 2 < m < \0\. Combining equations (fT^ and ((Ml) , and directly 
verifying the case m = 1, we get 

Tr {Qg{m) - Qg{m)) ^ ^ Qg{m)Q 9 {t): 1 < m < |0|. (25) 

^ t=0 

Eventually, inserting equation (12^ into equation (12^ . and changing the 
subscripts, we arrive at 


r ^ IC’I m—1 

^^Tr(^j — Ai) <-^ ^g{m)TA Qg{m)Qg{t) 

i=l m=l t=0 



E E min{Ajfc, Xje} Tr QikQji- 

(i,j):i<j k,e 


(26) 


Now we evaluate the second term of the right-hand side of equation (I2ip . 
Using equations m and (fT8]l . and employing the map g to indicate the 
subscripts, we can write 


r r Ti f ^^ / \\ 

^TVi,(l-ni(e)) -^ifcTrQifc I 1 ^ ^ P^oj y^iij I 

i=l i=l k=l \ £=1 / 

r Ti 

SEE \k Tr Qik - proj (^Sik^ ^ (27) 

i=l k=l 

\o\ 

= ^ \(m) Tr(5g(m) — proj (^Sg(^rn)^'^ ■ 

m=l 


Equation (fT7)l implies that 

m 

SUpp ^Qg(^m)^ ^g{m) — CD ' 

t=l 
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As a result, the identity matrix in the third line of equation (1271) can be 
replaced by proj (4(i)) • This gives 


|o| 


^Tr ij (1 - ni(e)) < ^ Ag(m) 


m—1 

TrQsM proj 
t=o 


| 0 | 

?n=l 


^m —1 

E 

^ t=o 


S. 


9{t) 


(28) 


where we have <Sg(o) = ^g{o) — {OIj rind for the equality we used equa¬ 
tion (fT7|l . The next step is to upper bound proj *^ 9 ( 4 )) > rvith a 

quantity in terms of Qg{t)- This can be done by directly applying 

Lemma [6l However, we notice that, for each 1 < i < r the subspaces in 
the set {Sik}k are orthogonal and we can make use of this fact to derive 
a tighter bound. For a pair of numbers {x,y), let [(x,?/)]i denote the first 
component: [(x,y)]i = x. We write 

m—1 r 

EA«) = EE. with ST := 0 4(0- (29) 

t=0 i=l t:0<t<m—l, 

[git)]i=i 

We will use Lemma [5] to bound the overlaps between each pair of the sub¬ 
spaces and then we apply Lemma [H Although we will only get a 

slightly better bound, compared to applying Lemma [6] directly, it is possible 
to make this improvement bigger by strengthening the result of Lemma [6l 
Now, due to Lemma [4] and the dehnition of S'g(t) [cf. equation (fT4|l ]. we can 
bound 

max ^ r, 0 <t' <t < \0\. 

Since is a subspace of <S'g(t'), it follows that 

max _ I (u|u')| < e, 0 < t ^ t' < \0\. 
h)G>Sg(t),b')G>5g(t/) 

Recalling the spectral decomposition of each Aj, we notice that the direct 
sum in equation (1291) has at most Tj terms. So, an application of Lemma [5] 
gives us that, for every 1 < m < PI, 


max K^’P)! < Te, I < i j < r, (30) 
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where T = max{ri,... ,Tr}. Equation (1301) allows us to apply Lemma[6]and 
obtain 

( m-l \ If 

S h(.) 1 £ s <3»«) • 1 £ ” £ loi> (31) 

for which we have also used equation ilUUli and the fact that proj — 

Qg(t)- Note that here the condition on 5 in LemmaElis satished for our later 
choice of e. Now, inserting equation (ISTI) into equation (l28t) . and making use 
of the relation Qg(m) ^ Qg{m) for all 0 < m < |0|, we arrive at 

r 1^1 

^Tru4j (1 — n.j(e)) < ^ \("i) ^ Qgim)Qgit), (32) 

which translates to 

^Trii (1 - ni(e)) < ^ ^ min{Aifc, Tr QjfcQjf. 

i=l {id)-i<j 

(33) 

Eventually, inserting equations (f26]l and (f33]l into equation (1211) and set¬ 
ting e = lets us obtain equation Q, with 

f^r,T) = ~ + 3 < 10(r-l)2r2, 

and we are done. □ 

4.4 Proof of the error exponent 

We are now ready for the proof of Theorem [1] 

Proof of Theorem{^ For the achievability part, we use Theorem [2j Let 
d = \T-L\ be the dimension of the associated Hilbert space of the states 
pi,..., Pr- The type counting lemma (see, e.g. [TO], Theorem 12.1.1) provides 
that the number of eigenspaces of the states pf ”,..., pf^ satishes 

P (pD < (n + 1)^ V 1 < i < r. 

For all 1 < i < r, let pf” = ^[k^Qik^ be written in the spectral decom¬ 
position. Theorem [2] gives 

<10(r-l)^(n + l)“ ^ (31) 

{i,j)-i<j k,l 
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Furthermore, for any 1 < i < j < r, we have 

k,i 

<maxfe,pj}^mm^^ (AS;;y(A5.;y""TVQS;;)Q5.;) (35) 

- k,£ 

= max{pi,pj} min (Trplp]-^) . 

0 < 5<1 \ J 


Inserting equation ()35l) into equation ()34l) . together with some basic calculus, 
results in 


Pl{{p,pT^---^VrpT}) 

<10(r —l)^C'^(n + 1)^'^ max{pi,... ,pr} max min (Tt plp\~‘^ 

{i,j)-i¥^j 0<s<l V 


(36) 


Where = '’^^2 is a binomial coefficient. Prom equation ()36p we easily 
derive 


lim inf 

n —>00 


-1 


n 


logPg* {{pipf'^,...,Prpf 


’■}) > min max 
^ o<s<i 


|-logTrpfpJ 


(37) 


On the other hand, the optimality part, that 

limsup — logPg* ({pipf",... ,Prpf"'}) < min max logTr , 

n^oo n 0<s<l t J 

(38) 

is a straightforward generalization of the r = 2 situation |29] ; see [31] for the 
proof. Alternatively, one can start with the one-shot bound of equation ([5]). 
Then we use the fact that equation ([7]), when applied to the i.i.d. states and 
acted by log” at both sides, becomes asymptotically an equality. Note 
that this is still based on the results of Nussbaum and Szkola in [29] . 

At last, equation (137]) and equation (1381) together are obviously equivalent 
to equation ([3]) and we conclude the proof of Theorem [TJ □ 


5 Discussion 

By explicitly constructing a family of asymptotically optimal measurements 
for testing quantum hypotheses {pf"',..., pf^}, we have proven the achiev- 
ability of the multiple quantum Chernoff distance, and eventually estab¬ 
lished that this is the optimal rate exponent at which the error decays. 
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In the nonasymptotic setting, we have obtained a new upper bound for 
the optimal average error probability in discriminating a set of density ma¬ 
trices {Ai,..., Ar}, which satisfy > 0 and are not necessarily normalized. 
Yuen, Kennedy and Lax m derived a formula for the optimal average error: 


P*{Ai,... ,Ar) = Tr — min{Tr Y : X > Ai, i = 1,... ,r} ; (39) 

i 


see also m and [2] for alternative formulations. However, the fact that 
equation (1391) involves an optimization problem itself, makes it difficult to 
apply this formula directly. Our upper bound stated in Theorem [21 though 
looser compared to equation (1391) . has an explicit form and there is a dual 
lower bound as shown in equation ([5]). We thus hope that it will find more 
applications. 

We wonder whether the states-dependent factor /(r, T) can be replaced 
by a constant, or at least can be improved such that it only depends on r 
(see also a similar conjecture made in [2]). While it is possible that we can 
improve Lemma |6| to give a better bound on /(r, T), we do not think that 
this can remove the dependence on T and r. In this direction, the pretty 
good measurement and its variant [lO], both of which achieve an error 

probability lying between P* and 2P*, may be useful tools to try. In fact, 
in Theorem [2] the dependence of our bound on T is not necessary: using 
the argument in [38], we can convert it into a dependence on the relation 
between the maximal and the minimal eigenvalues of the hypothetic states; 
see Proposition [7] below and the proof in the Appendix. This conversion is 
useful when the spectrum of each Ai is sufficiently flat, no matter how big 
the number of their eigenspaces is. 


Proposition 7. For alii = 1,... ,r, let XmaxiAi) be the maximal eigenvalue, 
and XminiAi) be the minimal nonzero eigenvalue of Ai. Denote 


L := max 


log2 


^^max (Ai) 
^min (Ai) 


log2 


‘2‘Xjnax {Aj. 

^min (Ar] 


Then, in Theorem\^ the states-dependent faetor f[r,T) ean be replaced by 
h{r,L) := 40(r - 1)2 l2. 


Another interesting question is how our method can be extended to deal 
with the problem of discriminating correlated states, where each of the hy¬ 
pothetic states ..., can be correlated among the n subsystems. The 
upper bound stated in Theorem [2I (also in Proposition [7| for an alternative 
states-dependent factor), together with the dual lower bound of equation ([5|), 
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can be used to analyse the asymptotic behavior of the error. This method 
may identify the optimal error exponent which can be quite different from 
reasonable generalizations of the Chernoff distance, in contrast to previons 
works which under certain conditions yield the mean quantum Chernoff dis¬ 
tance; see, for example, [izi[23[5ni[3n]. However, the main difficulty we will 
confront in this method is to characterize the spectral decomposition of the 
correlated states when n goes to infinity. At last, a particularly interesting 
problem in this setting, proposed by Audenaert and Mosonyi [2], is testing 
composite hypotheses, say, /o®” versus Here the sum may be re¬ 

placed by an integral. See also discussions in [5] and of this problem 
in the asymmetric case of Stein’s lemma. While our method for proving 
Theorem [1] does shed some light on this problem, it seems that a complete 
solution needs further ideas. 

Acknowledgments. The author is grateful to Charles Bennett, Aram 
Harrow, Graeme Smith, John Smolin and Andreas Winter, for very help¬ 
ful discussions and comments. He further thanks Fernando Brandao, Milan 
Mosonyi, Marco Tomamichel and the anonymous referees for their enthusi¬ 
asm and/or helpful comments. 

Appendix A: proof of Proposition [7] 


Proof. For an arbitrary nonnegative matrix A = XkQk written in the 
spectral decomposition form, define the modified version of A as 

M 

A'=Y, 2^X„un{A) Qk, 

m=l k-.Xk&Om 


where M := 
2^X 


log; 


2At] 


.(A) 


= n{A') and Om := {A/, : 2 ^ ^Xmin{A) < Xk < 


Amin (■^) 

(A)}. Then we have A < A' < 2A, and also A and A' commute. Now 
for Ai,..., A^, we define A'j^,..., A^ in a similarly way as A' was defined. 
Obviously, fJ(A') = logo . Applying Theorem [21 we can evaluate 


<10(r - ■ 4 ^ (^0) 

k,e 


On the other hand, since for all i, Aj < Al, we have by the definition of Pf 
that 

p: ({Ai, ..., A,}) < p: ({A;, ..., A/}) . (41) 

Equations ([40]) and ([41]) together lead to the advertised result. □ 
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